Fréchet Distance Based Approach for Searching Online Handwritten Documents

نویسندگان

  • E. Sriraghavendra
  • K. Karthik
  • Chiranjib Bhattacharyya
چکیده

We propose a novel, language-neutral approach for searching online handwritten text using Fréchet distance. Online handwritten data, which is available as a time series (x,y,t), is treated as representing a parameterized curve in two-dimensions and the problem of searching online handwritten text is posed as a problem of matching two curves in a two-dimensional Euclidean space. Fréchet distance is a natural measure for matching curves. The main contribution of this paper is the formulation of a variant of Fréchet distance that can be used for retrieving words even when only a prefix of the word is given as query. Extensive experiments on UNIPEN dataset1 consisting of over 16,000 words written by 7 users show that our method outperforms the state-of-the-art DTW method. Experiments were also conducted on a multilingual dataset, generated on a PDA, with encouraging results. Our approach can be used to implement useful, exciting features like auto-completion of handwriting in PDAs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word spotting for handwritten documents using Chamfer Distance and Dynamic Time Warping

A large amount of handwritten historical documents are located in libraries around the world. The desire to access, search, and explore these documents paves the way for a new age of knowledge sharing and promotes collaboration and understanding between human societies. Currently, the indexes for these documents are generated manually, which is very tedious and time consuming. Results produced ...

متن کامل

Connected Component Based Word Spotting on Persian Handwritten image documents

Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...

متن کامل

Indexing of Handwritten Historical Documents - Recent Progress

Indexing and searching collections of handwritten archival documents and manuscripts has always been a challenge because handwriting recognizers do not perform well on such noisy documents. Given a collection of documents written by a single author (or a few authors), one can apply a technique called word spotting. The approach is to cluster word images based on their visual appearance, after s...

متن کامل

Pen-Based Retrieval in Handwritten Documents

This paper describes techniques for searching in handwritten and handdrawn documents. We assume that more and more handwritten documents will accrue, as pen-based computers like PDA or TabletPC get increasingly popular. In order to manage the anticipated amount of handwritten documents, powerful abilities for searching within the documents are needed. The techniques, which we propose, are well ...

متن کامل

Closed Curves and Elementary Visual Object Identification

For two closed curves on a plane (discrete version) and local criteria for similarity of points on the curves one gets a potential, which describes the similarity between curve points. This is the base for a global similarity measure of closed curves (Fréchet distance). I use borderlines of handwritten digits to demonstrate an area of application. I imagine, measuring the similarity of closed c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007